Tone Quality Improvement of Bone Conduction Voice by Cepstrum-based Local Conversion Models
نویسندگان
چکیده
A novel tone quality improvement method for a bone conduction voice is presented. In the present method, the tone quality of the bone conduction voice is converted to the similar quality of the air conduction voice. For the voice conversion, the present method uses a codebook, which consists of various paired code vectors of the bone and air conduction voices. The deltaand mel-cepstral coefficients are employed as the code vectors. The delta-cepstral coefficients in the code vectors are first quantized and classified by a neural-gas’ network. The relationship between the mel-cepstral coefficients of the bone and air conduction voices in each class is described locally by a mathematical conversion model. The bone conduction voice is then converted into the clear air conduction voice by using those mathematical local models. The validity and effectiveness of the present method have been confirmed by applying it to the tone quality conversion problem of the real bone conduction voice. Key–Words: Bone conduction voice, Delta-cepstrum, Neural-gas’ network, Local conversion model
منابع مشابه
Using Context-based Statistical Models to Promote the Quality of Voice Conversion Systems
This article aims to examine methods of optimizing GMM-based voice conversion systems performance in which GMM method is introduced as the basic method for improvement of voice conversion systems performance. In the current methods, due to using a single conversion function to convert all speech units and subsequent spectral smoothing arising from statistical averaging, we will observe quality ...
متن کاملA Voice Conversion Method Combining Segmental GMM Mapping with Target Frame Selection
In this paper, a voice conversion approach that combines two distinct ideas is proposed to improve the converted-voice quality. The first idea is to map spectral features, e.g. discrete cepstrum coefficients (DCC), with segmental Gaussian mixture models (GMMs). That is, a single GMM of a large number of mixture components is replaced here with several voice-content specific GMMs each consisting...
متن کاملDevelopment and Evaluation of Bone-conducted Ultrasonic Hearing-aid Regarding Transmission of Speaker Emotion: Comparison of DSB-TC and DSB-SC Amplitude Modulation Method
Human listeners can perceive speech signals in a voicemodulated ultrasonic carrier from a bone-conduction stimulator even for sensorineural hearing loss patients. Considering this fact, we have developed a bone-conducted ultrasonic hearing aid (BCUHA). However, there remains considerable scope for improvement, particularly in terms of sound quality. Voice-modulated BCU is accompanied by a stron...
متن کاملCepstrum Based Voice Transformation Using ANN
The basic goal of the voice conversion system to mimics the characteristics of the target speaker voice by keeping the linguistic and paralinguistic information intact. The characteristics of a speaker in speech reflect at different level such as vocal tract, excitation and prosodic parameters. This propose work based on cepstrum which represents the vocal tract and excitation parameters of the...
متن کاملEvaluation of cross-language voice conversion based on GMM and straight
Voice conversion is a technique for producing utterances using any target speakers’ voice from a single source speaker’s utterance. In this paper, we apply cross-language voice conversion between Japanese and English to a system based on a Gaussian Mixture Model (GMM) method and STRAIGHT, a high quality vocoder. To investigate the effects of this conversion system across different languages, we...
متن کامل